A Method for Extracting Text from Stone Inscriptions Using Character Spotting
نویسندگان
چکیده
A novel interactive technique for extraction of text characters from the images of stone inscriptions is introduced in this paper. It is designed particularly for on-site processing of inscription images acquired at various historic palaces, monuments, and temples. Its underlying principle is made of several robust character-analytic elements like HoG features, vowel diacritics, and location-bounded scan lines. Since the process involves character spotting and extraction of the inscribed information to editable text, it would subsequently help the archaeologists for epigraphy, transliteration, and translation of rock inscriptions, particularly for the ones having high degradations, noise, and a variety of styles according to the mason origin and reign. The spotted characters can also be used to create a database for ancient script analysis and related archaeological work. We have tested our method on various stone inscriptions collected from some of the heritage sites of Karnataka, India, and the results are quite promising. An Android application of the proposed work is also developed to aid the epigraphers in the study of inscriptions using a tablet or a mobile phone.
منابع مشابه
Century Identification and Recognition of Ancient Tamil Character Recognition
Recognition of ancient Tamil hand written characters from inscriptions is difficult. Ancient Tamil characters are different from current century's Tamil character. This paper concentrates on the century identification of ancient Tamil characters and converting them into current century's form using MATLAB. In this paper, a method for recognizing Tamil characters from stone
متن کاملCentury Identification and Recognition of Ancient Tamil Character Recognition
Recognition of ancient Tamil hand written characters from inscriptions is difficult. Ancient Tamil characters are different from current century’s Tamil character. This paper concentrates on the century identification of ancient Tamil characters and converting them into current century’s form using MATLAB. In this paper, a method for recognizing Tamil characters from stone inscriptions, called ...
متن کامل3d-sutra – Interactive Analysis Tool for a Web- Atlas of Scanned Sutra Inscriptions in China
Buddhistic stone inscriptions (8th-12th centuries) are important cultural assets of China which need to be documented, analyzed, interpreted and visualized archaeologically, art-historically and text-scientifically. On one hand such buddhistic stone inscriptions have to be conserved for future generations but on the other hand further possibilities for analyzing the data could be enabled when t...
متن کاملZone-based Keyword Spotting in Bangla and Devanagari Documents
In this paper we present a word spotting system in text lines for offline Indic scripts such as Bangla (Bengali) and Devanagari. Recently, it was shown that zone-wise recognition method improves the word recognition performance than conventional full word recognition system in Indic scripts [29]. Inspired with this idea we consider the zone segmentation approach and use middle zone information ...
متن کاملKeyword Spotting from Online Chinese Handwritten Documents using One-versus-All Character Classification Model
In this paper, we propose a method for text-query-based keyword spotting from online Chinese handwritten documents using character classi ̄cation model. The similarity between the query word and handwriting is obtained by combining the character classi ̄cation scores. The classi ̄er is trained by one-versus-all strategy so that it gives high similarity to the target class and low scores to the oth...
متن کامل